PERCOLATION IN A HIERARCHICAL RANDOM GRAPH 
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Abstract. We study asymptotic percolation as N — » oo in an infinite ran- 
dom graph 5jv embedded in the hierarchical group of order N, with con- 
nection probabilities depending on an ultrametric distance between vertices. 
Gist is structured as a cascade of finite random subgraphs of (approximate) 
Erdcfe-Renyi type. However, the results are different from those of classical 
random graphs, e.g., the average length of paths in the giant component of an 
ultrametric ball is much longer than in the classical case. We give a criterion 
for percolation, and show that percolation takes place along giant compo- 
nents of giant components at the previous level in the cascade of subgraphs 
for all consecutive hierarchical distances. The proof involves a hierarchy of 
"doubly stochastic" random graphs with vertices having an internal structure 
and random connection probabilities. 



1. Introduction 

Random graphs have been used to analyze percolation in some infinite systems 
(e.g., [3 [7] and references therein). On the other hand, hierarchical structures 
have been used in applications in physics, biology (in particular, genetics), etc., 
where an underlying ultrametric distance plays a basic role (e.g., [T31 fT5l ITt5I l2"fj] 
and references therein). Hence it is natural to consider infinite random graphs 
embedded in hierarchical structures, with the probability of connection between 
two vertices depending on an ultrametric distance between them. In this paper we 
study percolation in an infinite random graph, Gn, embedded in an ultrametric 
group, r2jv, called hierarchical group of order N, as N — > oo. The structure of 
Gn is a cascade of (approximate) Erdos-Renyi random subgraphs at consecutive 
hierarchical distances, and this allows using results from the classical theory of 
random graphs [U \17\ [TBI [22] . However, we use them only as a technical tool. As 
we shall see, for Gn there are different results from the classical ones. An essential 
feature of Gn is that, due to the tree-type form of f2jv, any path connecting 
two vertices must contain an edge (or 1-step connection) of size equal to the 
hierarchical distance between them. In order to prove that percolation occurs 
along giant components in the cascade at consecutive distances, we introduce 
a family of random graphs whose vertices have an internal structure involving 
giant components at the previous hierarchical levels. In these "doubly stochastic" 
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random graphs the connection probabilities between vertices are random variables 
which arc highly dependent, but asymptotically deterministic as N — ► oo. There 
are papers that consider random graphs with connection probabilities depending 
on a distance between vertices or some function of pairs of vertices (e.g., [5l [8l [TOl 
[13]), and/or involving hierarchical structures or ultrametrics (e.g., [31 [8l [20l [211 
E21 EH ESI EZ] ) , but we have not seen the setup we consider here. 

Definition 1.1. The hierarchical group of order N (integer > 2) is defined as 



with addition componentwise mod N, i.e., Qn is the direct sum of a countable 
number of copies of the cyclic group of order N. The hierarchical distance on Ojv 
is given by 



Note that d(-, •) is translation-invariant, and it is an ultrametric, i.e., it satisfies 
the strong (or non- Archimedean) triangle inequality: for any x, y, z, 



and for each integer k > 1, a k-ball in f2jv (i.e., a set of points at distance at most 
k from each other) contains N k points, and it is the union of N (k — l)-balls which 
are at distance k from each other (a 0-ball is a single point). 

The group £In has been used as a state space for stochastic models in several 
applications (e.g., [15] and references therein). 

We are interested in studying percolation properties on f2jy, and to this end we 
consider a random graph Qn whose vertex set is f2jv, with connection probabili- 
ties depending on the hierarchical distance between points. We parameterize the 
connection probabilities so as to characterize the critical regime for percolation. 

The random graph Gn is defined as follows. 

Definition 1.2. We define an infinite random graph Qn with the points of f2jv as 
vertices, and for each k > 1 the probability of connection between x and y such 
that g?(x, y) = k is given by 



where Ck is a positive constant (independent of N) , all connections being indepen- 
dent. 

In this paper we do not investigate percolation for fixed N, we obtain a result 
for large N, i.e., asymptotic percolation as N — > oo, by means of the theory of 
random graphs. This method is natural for the framework of the hierarchy of 
random graphs which constitutes Qn- 

Recall that in the classical theory of Erdfls-Renyi (ER) random graphs Q{N,p) 
with TV vertices and connection probability p, the right parameterization for exis- 
tence of giant components as N — > oo is p — c/N, c > 1. Analogously, the choice 
of connection probabilities (|l.lj) will allow us to characterize percolation in Qn as 
N — > oo in terms of the sequence (c^) with all Ck > 1. One of our main tools 



f2jv 



{x = (x u x 2 , ■ ■ ■) : Xi G {0, 1, . . . ,N - 1}, i = 1, 2, . . . , ; 

Xi only for finitely many i}, 




d(x,y) < max{ci(x,z),(i(z,y)}, 
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is the ER theory, which we will apply recursively along consecutive hierarchical 
distances. Since a fc-ball has N k points, the ER theory would require connection 
probabilities Ck/N k ,Ck > 1. The fact that percolation in Gn can occur with con- 
nection probabilities of the form (jl.ip is due to the internal hierarchical structure 
of fc-balls. We shall see in Section 3.2 that the average length of shortest paths 
between randomly chosen points in the giant component of a fc-ball is of order 
(log-/V) fc , which should be compared with klogN in the classical case. 

To begin, we observe why the theory of ER random graphs can be used to 
investigate Gn- By two different (k— l)-balls in a given fc-ball are connected 

within the fc-ball with probability 

N 2(k-1) 

P Nk = l-(l-J^T) , *>1, P W,1 = |- (1-2) 

Hence p N > k ~ Ck/N as N — > oo. More precisely, using the elementary inequality 



0< -y 



1-1 



. 2 2 

m \ y 



< ( — j y, 0<y<n, m > 2, (1.3) 



n 

with y = Cfc, n = N 2k ~ 1 , m — iV 2 ^ fc_1 \ we have from ((T 

JV.fc _ C fc , J 1 \ „„ Ar „JV,1 _ C l 



Two (fc— l)-balls in a fc-ball can also be connected through points outside the /c-ball. 
However, as we shall see, this does not add more than o(l/N) to the probability of 
connection between the (k— l)-balls. On the other hand, we are interested in what 
happens inside the fc-ball. Therefore we will disregard connections through points 
outside the fc-ball. By (|1.4|) . for any k > 1, a fc-ball with its N (k — l)-balls as 
vertices is a random graph Q(N,p ' ) which approximates the ER random graph 
G(N, Ck/N) as N — > oo. Note that (jl.ip is equal to Ck/N with a norming which is 
the product of the sizes of two (k — l)-balls. (See Remark 2.4). 

The tree representation of fl^ (See Figure 1) shows Gn as a cascade of random 
subgraphs G(N,p N ' k ), k = 1, 2, . . . , where the form (1.4) of the connection proba- 
bilities is inherited throughout the hierarchy Note that the hierarchical distances 
k are the same for all N. 

We consider asymptotic percolation in Gn, meaning that, as N — > oo, there 
exists a path going from € f2jv to infinity, i.e., to points at arbitrarily large hier- 
archical distances, with positive probability. This is made precise in the following 
definition. 

Definition 1.3. We say that there is asymptotic percolation in Gn if 

Pperc '■= haf liminf P[0 G f2jy is connected by a path to a point at distance k] 

k N^oo 

>0. (1.5) 



In Theorem 12.21 we give a criterion for asymptotic percolation in Gn, and we 
show that percolation occurs along giant components of giant components at the 
previous level in the cascade of subgraphs G(N,p N ' k ) for all hierarchical distances 
k. This involves replacing the connection probabilities p N,k , given by (II. 2p . by 
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J-ball 




9 graphs Q(N, p NJ ), 3 graphs Q(N, p"- 3 ), 1 graph Q(N, p" 1 ) 



Figure 1 . Tree representation of f2jy 



random connection probabilities between giant components, which generates a 
hierarchy of doubly stochastic random graphs. Although we use the basic results on 
the sizes of components in ER random graphs, most of the work in the proof of the 
theorem consists in formulating the problem in such a way that those results can 
be employed to obtain upper and lower bounds for the probability of percolation. 
Another part of the proof consists in showing that percolation occurs along the 
cascade of giant components at consecutive hierarchical levels. We will also discuss 
the possible form of percolation paths, and comment on approximate scale-free 
degree sequences in the graphs G{N, Ck/N) for large k. 

In a final section we make some heuristic comments regarding doubly stochastic 
random graphs of the type used for the proof of Theorem [272] and their application, 
in particular referring to average distance and central limit theorem for the cascade 
of giant components. These random graphs give rise to questions that suggest 
further research which would be of independent interest. 
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2. Asymptotic percolation and cascade percolation 

We will use the following fundamental result on ER random graphs Q(N, c/N) 
(see, e.g., Theorem 5.4; recall that "a.a.s." for a graph property means that 
the probability that the random graph possesses the property tends to 1 as N — ► oo 
[22]): 

(A) If c > 1, then a.a.s. G(N, c/N) has a unique giant component, its size is f5N , 
where (3 € (0, 1) satisfies j3 = 1 — e~ c/3 , and all the other components have sizes at 
most log TV. 

For each k > 1 and each fc-ball, we have a random graph G(N,p N ' k ) whose 
vertices are the N (k— l)-balls contained in the /c-ball, with connection probability 
pN,k gjygjj by (ji 2p . We assume that c k > 1 for all k, hence, because of (II ,4|) and 
(A), each G(N,p N,k ) has a unique giant component a.a.s.. We may assume that 
N is large enough so that the giant components have emerged in all fc-balls for 
all k. Note that, by ([O]) . p N ' k > q N > k /N, where q N ' k = c k (l - c k /2N) > 1 for 
N > c\/2(ck — 1), therefore unique giant components emerge in G(N,p N,k ) for 
all k as N — > oo. Since percolation involves c k — > oo as k — > oo, our method is 
restricted to the limiting situation N — > oo. 

If S 1 is a non-empty set of vertices of G(N,p N ' k ), we denote by 11511 the number 
of its elements (hence < ||5|| < N), and by |5| the number of points of Ojv 
contained in S, thus |5| = ||5||7V' £ - 1 . 

It is reasonable to consider percolation with connections along giant components 
in the subgraphs G(N,p N ' k ) at consecutive hierarchical distances k. Indeed, it 
may happen that is in the giant component of the 1-ball it belongs to, and 
that this 1-ball is in the giant component of the 2-ball it belongs to, but it may 
also happen that is not in the giant component of the 1-ball it belongs to, 
but that it is connected to the giant component of the 2-ball, which may still be 
good for percolation. However, by (A), the second possibility is negligible with 
respect to the first one as N — > oo. The same argument can be made for the 
vertices of the subgraphs G{N,p N ' k ) and any two consecutive values of k. In 
addition, if y( fc ) denotes the number of 1-step connections from a point to points 
at distance k from it, we shall sec that P[y( fc ) > 0] - c k N- {k -^ as N -> oo for 
each k > 1, so, the probability of connections at distance k is negligible w.r.t. the 
probability of connections at distance k — 1, and all external connections from a 
A:-ball will be at distance k + 1 from it for all k as N — ► oo. Therefore we will 
look at asymptotic percolation along giant components at consecutive distances 
k. Moreover, we will consider asymptotic percolation along the cascade of giant 
components of giant components at the previous level for all hierarchical distances 
k. This precisely means that is in the giant component of the 1-ball it belongs 
to; this giant component is connected to other giant components of 1-balls in the 
2-ball in they belong to, forming a level 2 giant component; and so on for every 
hierarchical distance k. For brevity, we call cascade percolation this special form 
of percolation. 

In order to study cascade percolation, for each k > 1 and large N we consider 
a random graph Gn,u whose vertices are the (k — l)-balls that are the vertices of 
G(N,p N ' k ), but now each vertex has an internal structure which involves all the 
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components in the j-balls, j = 1, . . . , k — 2, in the (k — l)-ball. Clearly, the giant 
components are the ones that will determine percolation. These random graphs 
are described in the proof of Theorem 2.2, and their composition is reformulated 
after the proof in terms of giant components alone. 
For each k > 1, let /3k € (0, 1) satisfy 

l3 k = l-e- c «ti-^\ /? = 1, (2.1) 

where CkP^-i > 1- We will show that the probability of asymptotic percolation 
-Ppcrc is given by Y[ Pk- Hence we need a condition for positivity of this product in 
terms of the Cf., which are the data of the graph Qn. This is given in the following 
lemma (proved in the Appendix). 

Lemma 2.1. Assume Ck S oo as k — > oo, c± > 2 log 2 and C2 > 8 log 2. Then 

(} k > 1/2 and c h fi\_ x > 1 for all k, (2.2) 

and 

oo oo 

JJ fa > if and only if ^ e~ Cfc < oo. (2.3) 

k=l k=l 

The main result in this paper is the following theorem. 

Theorem 2.2. Assume c k S oo as k — > oo, c x > 2 log 2, c 2 > 8 log 2. Then there 
is asymptotic percolation in Gn if and only i/X^feLi e _Cfc < oo, asymptotic perco- 
lation occurs in the form of cascade percolation, and the probability of percolation 
U.5\) is given by 



Ppcrc = 1] fa- (2.4) 



k=l 

Proof. Let us assume that percolation takes places only with connections along 
consecutive pairs of hierarchical distances k. In the last part of the proof we will 
show that this is so. 

We prove first that ^ e~ Ck — oo implies there is no percolation in G^ along con- 
secutive hierarchical distances. By the left-hand inequality in (|1.3p , the connection 
probability p N ' k given by (|1.2p satisfies 

N,k 21. 
y N 

Hence it suffices to show that there is no percolation along the random graphs 
G(N,Ck/N) at distance k for every k in the weaker sense that is in the giant 
component of the 1-ball it belongs to, this 1-ball is in the giant component of 
the 2-ball it belongs to, and so on. The probability of percolation along the 
graphs Q(N,Ck/N) is given by Il^Li 7fe (connections at different distances are 
independent), where, by (A), 7^ > satisfies 

7fe = 1 - eT Cklk (2.5) 

for each k > 1. Now, e~ Ck = oo implies ^ e_Cfc7fc = °°j since all jk < 1, 
which by (|2.5|) implies JZ(1 — 7*) = 00 > an< ^ therefore Yilk = 0. Hence there 



PERCOLATION IN A HIERARCHICAL RANDOM GRAPH 



7 



is no percolation. So, percolation along consecutive hierarchical distances implies 
e~ Cfe < oo. 

Note that if for some fc the vertex ((fc — l)-ball) in Q(N, Ck/N) containing is 
not in the giant component of the fc-ball, then by (A) the corresponding 7^ in Y[ Ik 
is 0. More precisely, if the (fc — l)-ball containing is not in the giant component 
in the fc-ball, then, by (A), is connected to at most 0(N k ~ 1 log TV) points within 
the fc-ball. But the probability of connection between a set of N k ~ 1 log iV points 
in the fc-ball and points in the (fc + l)-ball outside the fc-ball is, by (jl.ip . 

Cfc+1 x 



logiV 

~c fe+ i— — >0 as N -+00. 

Therefore there is no percolation. This observation allows us to omit non-giant 
components in the proofs. 

Now we assume J2 e ~ Ck < °°7 hence YlPk > by Lemma [2~T1 We denote by 
Qperc the probability of cascade percolation. We will show that 



n ( 2 - 6 ) 



^jpcrc — 

fc=l 

This obviously implies that there is asymptotic percolation in Qm- 
We prove first that 

00 

Qperc>n^" ( 27 ) 
fc=l 

We proceed by steps. 

For fc = 2, a given 2-ball and large N, the graph Gn,2 has N vertices, which 
are the 1-balls in the 2-ball, and the internal structure of a 1-ball is the family 
of its components. Although only the giant component G' 1 ' will be relevant for 
percolation, the other ones complete the size of the 1-ball, and this plays a role in 
the proof. By (|1.1|) . two giant components G- 1 ^ , Gj ,i 7^ j (which are at distance 2 
from each other) are connected with conditional probability (given the sizes |G^|, 



|Gf I) 



N.2 1 I 1 c 2 

Pa = 1 - 



\ \G\ L '\\G^\ 

(2.8) 



We write p^' 2 as 



where 



iV 3 / 



N.2 C201 . A N,2 . U N,2 / n \ 

Pij =^r +A ij +B v ' ^ 2 - 9 ) 



-2 (\Gf\\Gf\ 0{ 
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By CCl, A% 2 < and 

|g1 1) ||G' {1) Ix 2 ^ 2 



1 " 1 - [ N 3 J 2 ~ 2N 2 ' 



since \g\ 1) \ < N. Hence 

- S * A « 2 * °- (2 - i2) 

We take < <5 2 < 1 and delete from (?jv,2 the vertices (1-balls) with giant 
components G^ such that 

\G ( i 1) \<p 1 (l-S 2 )N, (2.13) 

where 0i satisfies (|2.1|) with k — 1, except the one that contains if it happens 
to be in this case. For a pair ij of giant components that do not satisfy (|2.13[) we 
have, by (|2~TTj) and (j2~T5j) . 

< 2 > ^((1 - 5 2 f 1) = ^(<5 2 2 - 2fe). (2.14) 
Then, from (|2.9|) . (|2.12p and ()2.14|) . for the remaining giant components we have 

Since C2/3f > 1, by Lemma 2.1, for any < e 2 < 1 such that c 2 /3^(l — e 2 ) > 1 
we can take S 2 small enough and N 2 large enough so that, from (|2.15p . 

(e 2 ) 

4' 2> V' Wh6re lt ] =C2Pl{l-e 2 )>\ 1 (2.16) 

for all N > N 2 , for all pairs ij of giant components which do not satisfy (|2 . 13|) . 

Let M N ' 2 denote the number of 1-balls of Gn,2 whose giant components satisfy 
(12~T5D . M N - 2 is distributed Bm(N,P[\G^\ < /?i(l - S 2 )N}) (the \GP\ are i.i.d.). 
Hence, for any < 77 < 1, 



M 



N,2 



N 



> V 



< -^—EM N ' 2 = -P[\G^ \<(3i(l- S 2 )N] 
Nrj r\ 

— > as N — ► 00, 



by (EH) for k = 1 and (A) for the Si? graph g(N,d/N) (see also [J, p.167). So, 
M w ' 2 = o(iV) as N -> 00 (op(JV) in the notation of g^, p. 11). Therefore, deleting 
from C?at j2 the 1-balls whose giant components satisfy (|2.13[) does not impair the 
emergence of a giant component in Gn,2, which actually emerges due to (|2. 16[) . 
The giant component of Gn.2 is a level 2 giant component, whose elements are 
1-balls with their internal structures. 

For k — 3, a given 3-ball and large N, the graph Gn,3 has N vertices, which are 
the 2-balls in the 3-ball, and the internal structure of a 2-ball is the family of its 
components, in particular the giant component of the corresponding graph Gn,2, 
as described above. Having deleted from each 2-ball an o(N) number of 1-balls as 
above, we are left only with 1-balls whose connection probabilities between their 
giant components p^' 2 satisfy (|2.16[) (and the 1-ball containing 0). We now connect 
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these 1-balls with the smaller probability q^ 2 ^ /N (recall that we are looking for 
a lower bound for (5 perc ), and we consider the resulting level 2 giant components 

Gf) - 

By (jl.ljl . two giant components G\ 2 \ G^\ i ^ j (which are at distance 3 from 
each other) are connected with probability 



where \Cf ] \ = \\Gf ] \\N (each 1-ball in G*f } contains N points), and, by (A), 

||Gf ||~/f 2) iV a.a.s., (2.18) 

where > satisfies 

/^=l- e -« e2 \ (2.19) 
Now we are in a similar situation as in the previous step {k = 2), with 1-balls 
playing the role of points and q^ 2 ^ playing the role of c\. The fact that o(N) 

1- balls have been deleted from each 2-ball has no effect, and in any case it helps 
towards a lower bound. So, proceeding as above with (|2.17p . (|2. 18[> . (|2.19|) . we 

(2) 

take < 63 < 1 and delete from Gn.3 the 2-balls with giant components G\ such 
that 

\\Gf ] \\ <pi £2) (l-S 3 )N, (2.20) 
(except the one containing 0). Choosing 63 small enough and N3 large enough (and 
larger than N2), we have that the connection probabilities p^' 3 for the remaining 
giant components satisfy 

4' 3> ^V^' where <lt £3) =cM £2) ni-^)>h (2-21) 

with small enough £3, for all N > N3. Note that since q^ 2 ^ /* c 2 Pf as £2 \ 0, 
then, form JUTS]), ff 2 2) / p\ as £ 2 \ 0, where j3 2 satisfies JUT) with k = 2, and 
since C3/3f > 1 by Lemma 2.1, then £2 and £3 can be taken small enough so that 

qt' ss) > 1 

bmce 4 E2) < I3 2 < 72 (see (22)), we have as above that the number of deleted 

2- balls from Gn,3 is o(N) as N — > 00, and we connect the remaining ones with the 
smaller probability q^ 61 ' 63 ^ /N. 

Iterating this scheme, for each k, each given fc-ball and large N, we have a 
graph Qn,r whose N vertices are the (k — l)-balls in the fc-ball with their internal 
structures, which involve the level (k — 1) giant components in the corresponding 
graphs GN,k-i- A number o(N) of vertices of QN,k are deleted, as above, the 
remaining ones are connected with the smaller probability q^ 2 ''"' ek ^ /N , where 

q (s 2 ,...,e k ) = cfc(/3 ( e _ 2 ,...,e fc _ l))2(1 _ £k) > ^ (2 22) 

and p[ £2 -' £k) > satisfies 

^••- e *> = l- e -«ir a ek) ti 2 efe> . (2.23) 
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Clearly, because of the deletions we have made and the use of smaller connection 
probabilities, we have 



?pcrc > 01 J] Pk 
k=l 



(£2, •••,£&) 



(2.24) 



(connections at different distances are independent). 

Letting e 2 ,£fc, .. . \ (in this order), we have from ([2~22l and ([2~23"f : q!f 2) / 
c 2 /3 2 , hence /3^ 2) / /3 2 , where f3 2 satisfies (J2J]) for k = 2, hence q ( 3 E2 ' E3) / c 3 /3^ 
hence /3 3 E2 ' e3 ^ /* (3 3l where (3 3 satisfies (|2.1[) for k = 3, and so on. Therefore, we 
obtain j2J|) from (f2T24|) . 

Note that the cascade of giant components containing will always remain, and 
in any case it would not be deleted according to (|2.13j) . (I2.20|) . etc., a.a.s.. 

We now prove that 

oo 

Qpcrc<n^- ( 2 ' 25 ) 

fc=l 

The approach is analogous to the one used for proving (|2.7p . but now we replace 
the random connection probabilities between giant components by upper bounds. 
Since non-giant components in fc-balls have no effect on percolation, we consider 
only the connections between giant components. 

For k = 2, from (J2J]), (|2~TTj) . (j2~T2|) . and (A) for the ER graph Q(N,c x /N) we 
have 



N,2 . 
Pin < 



N 



N 



N 



N 3 



N 2 



N 
-Pi 



< ^(P! + S 2 ) a.a.s. 
for any S 2 > 0. Now we connect the 1-balls in a 2-ball with probability c 2 (j3 2 + 

(2) 

5 2 )/N, and we consider the resulting level 2 giant components G\ in 2-balls 
(we keep the notation G\ used before, but now the connection probabilities are 
different). 

(2) (2) 

For k — 3, two giant components G\ ,Gj , i ^ j, in a 3-ball are connected, by 
(jl.ip . as above, with probability 



N,3 



= 1-1- 



C3 

N 5 



iC2)||^(3)| 



where \G 



(2) I - 11/3(2) 



G\ \\N, and, by (A), 



||G 



(2) 



/3^ 2) Af a.a.s., 



where > satisfies 



1 — e ^ 2 2 



(2.26) 

(2.27) 
(2.28) 
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with 



<?f 3 =c 2 (/3?+J 2 ) 
Proceeding the same way as above, it follows from (|2.26[) - (I2.29|) that 



(2.29) 



N,a ^ ci\p 2 ) , 

Pij' < ~ T + c -3 



TV 



\G\ 2) \\Gf | (fi^f 



N 



£3 

N 



N 5 

,(2) m ,^(2) 



N 2 



< |((^ 2) ) 2 + <5 3 ) a.a.s. 



for any ^3 > 0. 

Iteration of this scheme leads to 



< 



(5 2 ,...,5 k ) 
k ' 



fc=l 



iV 



IIOIIICHI 



(2.30) 



and ; 



\ /3fe as <5s, . . . \ 0, where fa satisfies (|2.1j) . 
We then obtain (|2T2"5) from (|2T5D]l . 
Since Q pe rc refers to a special form of percolation, we have Q por c < -Ppcrc- ln 
order to see that cascade percolation is equivalent to asymptotic percolation, and 
therefore <5 pC rc = -Pperc, which proves (|2.4[) by (|2.6p . it remains only to show that, 
as we assumed at the beginning of the proof, for each k all 1-step connections from 
a fc-ball are in the (k + l)-ball it belongs to a.a.s.. This is done later on in Lemma 
2.5. 

□ 

In the proof of Theorem 2.2 the random graph Qnm has N vertices which are 
the (k — l)-balls in a fc-ball, and each (A; — l)-ball has an internal structure which 
contains the giant and the other components in it. We replaced the connection 
probabilities between {k— l)-balls by the random connection probabilities between 
their giant components because they determine percolation, and then we replaced 
these random connection probabilities by upper and lower (deterministic) con- 
nection probabilities in order to obtain upper and lower bounds for Q pcrc . The 
non-giant components have no part in percolation, but still each (fc — l)-ball con- 
tained all its N (k — 2)-balls (with their internal structures), and this was relevant 
in the proof of the theorem (see Remark 2.4). The random graphs Gn,U can be 
redefined so that their vertices contain only giant components at previous lev- 
els, with random connection probabilities between the vertices. In this sense the 
graphs Qn,u are "doubly stochastic" . This way the cascade of giant components 
in a k-ball is more clearly depicted: all the points of f2jv in a vertex of Qn,u are 
connected, and for each j = 2, . . . , k, the giant component at level j consists of 
connected giant components at level j — 1. We denote by C N,k the size of the 
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cascade of giant components in a fc-ball (i.e., the number of points of Oat in the 
cascade). The growth of C N,k is given in the following corollary. 

Corollary 2.3 

C N ' k ~ (f[/3j)N k a.a.s., (2.31) 

where the f3j satisfy (2.1)) . 

Proof. For k = 1 and a giant component G' 1 ' of a 1-ball, we have 

|G (1) |-/3iiV a.a.s., (2.32) 

by (EU) and (A). 

For k — 2 and a level 2 giant component G' 2 ' (with the true connection prob- 
abilities, not upper and lower bounds as before) in a 2-ball, we have similarly, by 
(A), on the one hand, 

\\G (2) \\>P { 2 2) N a.a.s., (2.33) 

where satisfies (|2.19[) . and on the other hand, 

l|G (2) || <13 ( 2 &2) N a.a.s, (2.34) 

where f^ 2 ^ satisfies (|2.28[) . Since £2 and 82 are both arbitrarily small, we have 
from (|2~§3l) and (j2~34)l . 

\\G {2) \\~(3 2 N a.a.s.. (2.35) 

Now, each vertex of contains ~ (3\N points a.a.s. (not all N points, as in 
the proof of Theorem 2.2), therefore, from (|2.35[) , 

\G {2) \ ~ (3if3 2 N 2 a.a.s.. 

By induction we obtain (|2.3ip . □ 



Remark 2.4 We could consider the random graphs Qn.u with giant compo- 
nents alone in the proof of Theorem 2.2, but then, instead of (jl.ip . which is 
c k /N(N k - v ) 2 , by Corollary 2.3 we would use connection probabilities 

Ck 

between points x and y such that c?(x, y) = k, in order to restrict the internal 
structures to the cascades of giant component in the (k — l)-balls containing the 
points. The approach we have taken is more natural. 

To complete the proof of Theorem 2.2 it remains to prove the following lemma. 



Lemma 2.5 For each k > 0, all external 1-step connections from a given k-ball 
are at distance k+1 from it a.a.s. 

The proof of Lemma 2.5 is a special case of the following calculations, which are 
intended to give some idea of what percolation paths might look like as N — > 00. 

Let F( fc ) denote the number of 1-step connections from G iljv to points at 
hierarchical distance k > 1 (or from any given point of Oat to points at distance 
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k from it; recall that d(-, •) is translation- invariant). Since Y^ k ' is distributed 
Bin(iV fc - N k - 1 ,c k /N 2k - 1 ) 1 then 

Pois(ci) as iV — > oo , (2.36) 

and 

Y (k) ^ as N -*• oo, A; > 1. (2.37) 

In the case fe > 1, 

Cfc 

ryi v '' > uj ~ - 

and 



P[Y (k) > 0] ~ j^rr as JV - oo, (2.38) 



p[y( fc ) = n |y(*) > 0] -»• I 

Indeed, using 1 — a; ~ — logx as x — > 1, 

> o] = 1 - ( 1 - 



1 77 = 1 

„' as iV^oo. (2.39) 

0, 77, > 1 V ' 



JV fe — N k ~ 1 

Ck 



_/y2fc-i 



-(JV fc — iV fc — 1 )log( 1 



and now using xlog(l — a/x) ~ —a as an oo, a > 0, 

jv^pryW > o] ~ -(7v 2fc -! -7v 2fe - 2 )io g ^i 



M 2k ~ 1 



~ c fc , 

which proves ()2.38|) . For the proof of (|2.39[1 we have 

Pl yW =n \YW>0 ] = P jy^=J ] , 

and 



(N k -N k ~ l )\ ( Cfc 



N k -N k 



Ck 



n\(N k - N 1 *- 1 - n)\ ^N 2 ^ 1 

hence, by ([2~38]) . 

p[y(*0 = 77] {N k - iV fe - 1 )! A/*" 1 

pjyw > 0] ~ n\(N k - n*- 1 - n)\ cfc V^ 2fe_1 

l, n = l , T 

, as iv — > 00. 

0, 71 > 1 

The results (|2.36[) - (I2.39I) show that a.a.s. almost all 1-step connections from 
will be at distance 1 and the degree of 0, and of any point, will be distributed 
Pois(ci). Also, for each k > 1 the probability of 1-step connections from to 
points at distance k is negligible w.r.t. the probability of connections to points at 
distance k — 1, and given that it has connections at distance k, there is only one 
such connection, as N — > 00. In a similar way it can be shown that the number of 
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points in the giant component of any 1-ball which have external 1-step connections 

at distance 2 from it is asymptotically Pois(c2/3i) as N — > oo. 

Similarly, let Y^ k '^ denote the number of 1-step connections from a given fc- 

ball to fc-balls at distance j > fc + 1 from it. is distributed Bin(JV J ' -k — 

N j-l-k )p N,k,j^ where 

is the probability of 1-step connections between two fc-balls at distance j from each 
other. 

In the same way as above we have 

J^-j^SpT ^ tf-oo, (2.41) 

and it follows that 

y(fe,fc+i) ^ Pois(c fc+1 ) as N^oo, (2.42) 
r (fej-) =>■ as N -> co, j>jfe+l, (2.43) 



and for j > fc + 1 , 



P[y(*^ > 0] ~ Ar .g fe _ 1 as N > co, (2.44) 



P [y(fcj) _ „|y(**0 > o] J n 1 as TV -» oo. (2.45) 



and 

p\v(k,j) _ „iv(*.J') ^ nl _^ J 

0, n > 1 

Analogous results can be obtained, using (A), for the number of 1-step connec- 
tions between a giant component in a fc-ball and other such giant components at 
distance j > k + 1 from it, with Cj replaced by Cj0l in (|2llj) . ([27S2]) . (f2T44|l . It 
can also be shown that the number of points in the cascade of giant components 
in a fc-ball with external 1-step connections at distance fc + 1 is asymptotically 
Pois(c fc+ i JXf = i Pj) as N -> oo. 

From (f2T55]l - (|23g]l and l2T4"2|) - (|5^5|) . we see that the same pattern described 
above for connections between individual points is repeated for connections be- 
tween giant components of giant components at the previous level in fc-balls for 
all fc. In particular, for each fc all the external 1-step connections from the giant 
component in a given fc-ball are at distance fc + 1 from it a.a.s. 

Note that these arguments also prove Lemma 2.5. 

With these results one can also see that connections between (fc — l)-balls in a 
fc-ball through points outside the fc-ball have probability o(l/N), as stated in the 
Introduction. 

These results suggest the following picture for the most likely form of cascade 
percolation paths: is in the giant component of the 1-ball it belongs to. A 
Pois(c2/3i) number of points in this giant component have external single 1-step 
connections to points at distance 2 in such a way that the giant component is 
connected to the giant components of the other 1-balls, forming the level 2 giant 
component of the 2-ball they belong to. This level 2 giant component has a Pois 
{C3P1P2) number of points with external single 1-step connections to other level 2 
giant components in the 3-ball they belong to, forming a level 3 giant component. 
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And so on along the cascade of giant components for every distance k. Each path 
starting from and going to infinity this way is a percolation path. 

Example. Let = alogfc for large k, a > 0. Then there is asymptotic percola- 
tion if and only if a > 1. 



Remarks 2.6 

1. Each of the following is a sufficient condition for percolation: 



c k 5 < oo for some S > 0, liminf Ck/k > 0, liminf(cfc+i — c^) > 0. 

^ — ' k — >oc k — >oo 

k=l 

2. The degree sequence of ER random graphs has an approximate Poisson dis- 
tribution. On the other hand, the random graphs of [2] have scale-free degree 
distributions. A rigorous study of scale- free random graphs, including a coupling 
with ER random graphs, is contained in [ITJ[l2]. We will see that in the graphs 
Q{N,p N ' k ) the degrees in a neighborhood of Ck have an approximate scale-free 
behavior with exponent 1/2, for large k. 

Recall that in the random graph G(N,p N,k ) the vertices are the N (k— l)-balls 
in a fc-ball with p N,k given by (|1.2[) . Neglecting the term o(l/N) in (|1.4[) , we 
consider G(N, Ck/N). 

Let Xj denote the number of vertices in Q(N, Ck/N) of degree j = 0, 1, Then 

the proportion Xj/N has an approximate Poisson distribution [11] (Theorem 3), 
i.e., for each j and any < e < 1, 



P 



(1-e) 



Js < —L < (i -L e \Ji 



1 as N 



(2.46) 



Let Bk — {j : \j — Cfc| < M} for some constant M. Recall that Ck — » oo as 
k — > oo. Using the Stirling formula (in the form in |19j . p. 52), and the standard 
inequalities 



1 



*y< e *<fi-* 

yj V y 

it is easy to show that for any < S < 1, 



< x < y, y > 1, 



(1-6)- 



1 , r> p- c k 

1 —1/2 < L fc e 



/2tt 



J 



< 



-1/2 



(2.47) 



for all j € Bk and sufficiently large k. 

Combining (|2.46|) and (|2.47|) we have, for any < e < 1, 



P 



(1-e) 



-1/2 



1/2 



<§<(i + £)^= 
2tt " N - v 



1 as A — > oo 



(2.48) 



for all j G -Bfc and sufficiently large fc. 



3. Some open problems 
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3.1. A class of doubly stochastic random graphs. The essence of the proof of 
Theorem 2.2 is that the random dependent connection probabilities p^' k between 
giant components of (fc — l)-balls in a fc-ball obey 

N,k c k0k-l a 2 . , ,n ,1 

p v AT — a ' a - s -' Cfe ^-i > 1 ' t 3 - 1 ) 



in the sense that 



(l-e) <p ^< ^9^(1 + e) (3.2) 



a.a.s. for each fc and arbitrary e (such that Cfc J 8^_ 1 (l — e) > 1). Therefore as- 
ymptotic percolation in C/jv is as if it took place along the giant components of 
the ER random graphs Q(N,Ck0%_i/N) instead of Q(N,p N,k ), considering only 
connections between (k — l)-balls. 

This suggests the study of "doubly stochastic" random graphs of the form 
£7(-N){Py}*,j=i,...,iv) with vertices {vi, . . . , vn} and connection probabilities p™ 
for the vertices Vi and Vj, where the pfj are random variables which may be de- 
pendent, such that pfj ~ c/N, c > 1, for all ij a.a.s.. A natural question is which 
results of the theory of ER random graphs Q (N, c/N) can be extended to this more 
general class of random graphs. 

The usefulness of such extended results is illustrated in the following two prob- 
lems, which could be treated if they were available. 

3.2. Average distance in the cascade of giant components in a fc-ball. 

Let D(x,y) denote the (graph) distance between two points x and y (i.e. the 
length of the shortest path connecting the two points) chosen at random in the 
cascade of giant components in a given fc-ball, k > 1. If the two points are at 
hierarchical distance 1 from each other (i.e., they belong to the giant component 
of the same 1-ball), then D(x,y) grows like logiV/logci as N — > oo, according 
to Theorem 2.4.1 in [T7] and (1.1). If the hierarchical distance between the two 
points is larger, say fc (i.e., they are in different (k — l)-balls; note that this is the 
most likely outcome of the random choice of points, with probability 1 — 1/JV), 
then the expected result would be that D(x,y) grows like 

(logN) k 



as N — > oo, 



where the 0j-\ satisfy (|2.1[) and (|2.2[) . provided that Theorem 2.4.1 in [T7] could be 
extended to the random graphs with random connection probabilities p^' k obeying 

Heuristically, the reasoning is as follows. Starting from a given point in the 
giant component of a 1-ball, the shortest path to one of the points in the 1- 
ball that have external connections at distance 2 grows like log N/ log c\ (these 
points are chosen at random in the giant component of the 1-ball and all the 
shortest paths grow the same way). This happens in every 1-ball. Next, in the 
giant component in the 2-ball the shortest path from a given level 1 giant compo- 
nent to one of the level 1 giant components that have connections at distance 3 
should grow like log AT/ log(c2/3f), according to (|3.1|) . if the extended result holds. 
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Hence the (graph) distance between two points chosen at random in the level 2 
giant component of level 1 giant components in a 2-ball grows like the product 
(logN/ Cl )(logN/log(c 2 ^)) = (logAOVn^iM^-i). Etc. 

In contrast, the average distance in the giant component of the ER random 
graph Q(N k ,c/N k ),c > 1, grows like fc log N/ log c. Hence, although the sizes of 
the giant clusters in a fc-ball of f2jv (which has N k points) and in the ER graph 
g(N k ,c/N k ) are both of order N k (by (A) and Corollary 2.3), the typical length of 
paths is longer in the hierarchical case (of order (logiV) fc ). This happens because 
two points chosen at random in the cascade in a fc-ball are most likely hierarchical 
distance fc apart, and the paths connecting them have to go consecutively through 
fc hierarchical levels (by cascade percolation). 



3.3. Central limit theorem for the size of the cascade of giant compo- 
nents in a fc-ball. The following central limit theorem is proved in [4] for the 
fluctuation of the size \G | of the giant component G N of a ER random graph 
G(N,c/N),c>l: 

\° N \-P N >M( , a 2 ) as iV^oo, 



ATl/2 

where => denotes convergence in distribution, and 7V(0, a 2 ) is the normal distribu- 
tion with mean and variance a 2 = /3(1— /3)//i 2 , where j3 > satisfies 1— (3 = e~ c @ , 
and —fi is the slope of 1 — t — e~ ct at t = f3 (see also [17], Theorem 2.5.3). 

If this result could be extended to random graphs with random connection 
probabilities obeying (|3.1|) . we would have the following central limit result. Recall 
that C N ' k denotes the size of the cascade of giant components in a fc-ball (see 
Corollary 2.3). Let 

A - ' 1, Z, . . . . 

Then 

x N.k ^ (j[ ^ ag N ^ ^ 

for each fc, where /3fe > satisfies (|2.1[) . a\ = /3fc(l — /9fe)//i|, and — /Xfc is the slope 
of 1 -t -er c ^l-^ at t = (3 k . 

We argue by induction. For fc = 1 the result is obtained directly by the c.l.t. 
of [1]. Now we assume the result is true for fc — 1 and consider the case fc. Note 

that C N ' k — Y^}j=i " Cj V ' fe_1 , where ||G Ar,,c || is the size of the giant component 
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otg{N,CkPi_ 1 IN) 1 hence \\G N ' k \\ grows like /3 k N. We write X N ^ k as 

^ j=i \j=i / 

- i e ^ ^'- n^'- 
n ft )&^, ( 3. 3) 

2 — 1 ' 

where C^' _1 , j = 1, . . . , | |G JV,fc | |, are the sizes of the cascades in the (k — l)-balls 

forming the giant component of Q(N,Ck/3^_ 1 /N). The ' _1 are i.i.d, and by 
the induction assumption, 

1 fc - 1 - fn a)^*- 1 ) ^(o, (n &) Vi) ( 3 - 4 ) 



N (k-X)-l/1 i J 



for each j, as — > oo. It is easy to show from (|3.4[) that the first term on the 
r.h.s. of (|3.3p converges to in probability as N — > oo. The result then follows 
from the second term in the r.h.s. of (13.31). if the extended central limit theorem 



for the random graph with connection probabilities pf* obeying (|3. 1|) holds. 
Appendix 

Proof of Lemma 2.1 (|2~Tj) with fc = 1 and c x > 2 log 2 imply ft > 1/2. Then 
c 2 /3j > 2 log2, which by (|2~Tj) with fc = 2 implies ft > 1/2, and so on, so ft > 1/2 
for all k. 

From (|2.1|) we have 

1-ft = exp{-c fc /32_i(l-e- c ^-^ fe )} 

(using ft > 1/2 and xe~ x < e _1 ) 

where C = e 2e . Then, by (|2.1|> and Cfc > Cfc_i, 

1-ft < Ccxp{-c fe _i(l - e" *-^*-^*- 1 ) 2 } 

< C*cxp{-c fe _i(l - 2e- Cfc - 1 ^- 2/3fc - 1 )} 

< Ce"^- 1 exp{2cfc_ie~ Cfc - l/8 } 

< c ie - Ck -\ 

where C*i = Ce 16e ~\ 

Hence e ~ Ck < oo implies XX 1 — Pk) < 00 1 an d therefore J} ft > 0. 

The reverse inequality is clear, since all ft < 1. □ 
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